Computational genome-wide cell- type-specific predictions of dimeric transcription factor complexes

نویسندگان

  • Izabela Makałowska
  • Jerzy Tiuryn
  • Krzysztof Jozwiak
  • Karolina Pajak
  • Anita Plazinska
  • Irving W. Wainer
چکیده

It is estimated that the human genome contains as many as 8 000–15 000 copies of protein coding genes generated by reverse transcription. Although majority of them are in the state of a “relaxed” selection and remain “dormant”, as they are lacking regulatory regions, many become functional. The evolutionary path of these functional retrogenes’ is not uniform. In the course of the evolution they may undergo subfuctionalization and consequently share the function with their parent or develop a brand new function (neofunctionalisation). A good number of studies was recently performed to explore these unique sequences, yet our knowledge about retrogenes evolution and function is exceptionally limited. We performed a comprehensive analysis of 62 animal genomes to identify retrocopies of protein coding genes, study their evolution and decipher putative functions. Our analyses revealed that the fraction of expressed retrocopies is much higher than previously estimated and that retrocopies provide genetic material for new proteins, microRNA genes as well as trans natural antisense transcripts (trans-NAT). In addition, they may srve as microRNA sponges and, in case of nested retrocopies, possibly act as transcriptional interference factor and cause premature termination of host gene transcription. Moreover, our studies demonstrated remarkable differences in retrocopies evolution between placental mammals and other animals. L4.2

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comprehensive prediction in 78 human cell lines reveals rigidity and compactness of transcription factor dimers.

The binding of transcription factors (TFs) to their specific motifs in genomic regulatory regions is commonly studied in isolation. However, in order to elucidate the mechanisms of transcriptional regulation, it is essential to determine which TFs bind DNA cooperatively as dimers and to infer the precise nature of these interactions. So far, only a small number of such dimeric complexes are kno...

متن کامل

Modeling transcription factor complex binding to eukaryotic genomes

Author's declaration: aware of legal responsibility I hereby declare that I have written this dissertation myself and all the contents of the dissertation have been obtained by legal means. Abstract The binding of transcription factors (TFs) to their specific motifs in genomic regulatory elements of eukaryotic organisms is commonly studied in isolation. However , in order to elucidate the mecha...

متن کامل

A predictive modeling approach for cell line-specific long-range regulatory interactions

Long range regulatory interactions among distal enhancers and target genes are important for tissue-specific gene expression. Genome-scale identification of these interactions in a cell line-specific manner, especially using the fewest possible datasets, is a significant challenge. We develop a novel computational approach, Regulatory Interaction Prediction for Promoters and Long-range Enhancer...

متن کامل

REPA: Applying Pathway Analysis to Genome-wide Transcription Factor Binding Data.

Pathway analysis has been extensively applied to aid in the interpretation of the results of genome-wide transcription profiling studies, and has been shown to successfully find associations between the biological phenomena under study and biological pathways. There are two widely used approaches of pathway analysis: over-representation analysis, and gene set analysis. Recently genome-wide tran...

متن کامل

Limb-Enhancer Genie: An accessible resource of accurate enhancer predictions in the developing limb

Epigenomic mapping of enhancer-associated chromatin modifications facilitates the genome-wide discovery of tissue-specific enhancers in vivo. However, reliance on single chromatin marks leads to high rates of false-positive predictions. More sophisticated, integrative methods have been described, but commonly suffer from limited accessibility to the resulting predictions and reduced biological ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014